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Ramsey numbers of degenerate graphs 

Choongbum Lee * 


Abstract 

A graph is d-degenerate if all its subgraphs have a vertex of degree at most d. We prove that 
there exists a constant c such that for all natural numbers d and r, every d-degenerate graph 
H of chromatic number r with \V{H)\ > 2^^ ^ has Ramsey number at most 2^^^ \V{H)\. 

This solves a conjecture of Burr and Erdos from 1973. 

1 Introduction 

Ramsey theory studies problems that can be grouped under the common theme that ‘every large 
system contains a highly organized subsystem’. A classical example is the celebrated van der 
Waerden theorem |35] asserting that in every coloring of the natural numbers with a hnite number 
of colors, one can find monochromatic arithmetic progressions of arbitrary hnite length. This has 
motivated further results such as the Hales-Jewett theorem m and Szemeredi’s theorem [M] and 
has had a tremendous inhuence on Combinatorics and related helds. See [20] for a comprehensive 
overview of Ramsey theory. 

For a graph H, the Ramsey number of H, denoted r{H), is dehned as the minimum integer 
n such that in every edge two-coloring of Kn, the complete graph on n vertices, there exists a 
monochromatic copy of H. The name of the held has its origins in a 1930 paper of Frank P. 
Ramsey m, who proved that r{Kt) are hnite for all natural numbers t. In 1935, Erdos and 
Szekeres m rediscovered Ramsey’s theorem, and brought it to more widespread attention by 
hnding interesting geometric applications of its hypergraph extensions. Plenty of variants and 
applications have been found since then, and now it is considered to be one of the most important 
results in combinatorics, lying at the point of intersection of many helds. 

There are many fascinating problems concerning bounds on Ramsey numbers of various graphs. 
Erdos and Szekeres, in the paper mentioned above, established a recurrence relation on the Ramsey 
numbers of complete graphs that implies r{Kt) < for all natural numbers t. 

Later, in 1947, Erdos m, in one of the earliest applications of the probabilistic method, proved 
> 2b/2+c(i))L These two bounds together show that r{Kt) is exponential in terms of the 
number of vertices t. There have been some interesting improvements on these bounds 13132], 
but despite a great amount of effort, the constants in the exponents remain unchanged. See the 
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recent survey paper of Conlon, Fox, and Sudakov m for further information on graph Ramsey 
theory. 

In 1973, Burr and Erdos [3] initiated the study of Ramsey numbers of sparse graphs and 
conjectured that the behavior of Ramsey numbers of sparse graphs should be dramatically different 
from that of complete graphs. A graph is d-degenerate if all its subgraphs contain a vertex of 
degree at most d. Degeneracy is a natural measure of sparseness of graphs as it implies that for 
all subsets of vertices X, there are fewer than (i|A| edges with both endpoints in X. Burr and 
Erdos conjectured that for every natural number d, there exists a constant c = c{d) such that 
every d-degenerate graph H on n vertices satisfies r{H) < cn. This is in striking contrast with 
the case of complete graphs where the dependence on the number of vertices is exponential. 

This conjecture has received much attention and motivated several important developments 
over the past 40 years. For example, the work of Chvatal, Rodl, Szemeredi, and Trotter [5] from 
1983 that we will discuss below is one of the earliest applications of the regularity lemma, and 
fostered further developments such as the blow-up lemma of Komlos, Sarkozy, and Szemeredi 
[23j . Furthermore, progress on the conjecture is intertwined with the development of a powerful 
new tool in probabilistic combinatorics now known as dependent random choice. Kostochka and 
Rodl [23] used its primitive version to study a special case of the conjecture, and later, using a 
different method [25|, established the first polynomial bound r{H) < for all d-degenerate 
graphs H on n vertices (where Cd is a constant depending on d). The general framework of 
applying the dependent random choice technique to embed degenerate graphs was pioneered by 
Kostochka and Sudakov [26], who improved the bound of Kostochka and Rodl to a nearly linear 
bound r{H) < Later, Fox and Sudakov [TS] [H] refined the method to prove 

that r{H) < See the survey paper of Fox and Sudakov [IT] for an overview of the 

history and applications of this fascinating method. 

Linear bounds were established for special cases such as subdivisions of graphs by Alon [T], 
random graphs by Fox and Sudakov [16] , and graphs with small bandwidth by the author [27] . The 
conjecture has also been examined for weaker notions of sparseness. Chvatal, Rodl, Szemeredi, 
and Trotter [5] proved that if 77 is a graph on n vertices of maximum degree at most A, then 
r{H) < c(A)n, where c(A) is a constant depending only on A. Their proof used the regularity 
lemma and thus the dependence of c(A) on A was of tower type. This bound has been improved 
since then, by Eaton [12], Graham, Rodl, and Rucihski |18l [19], and then by Conlon, Fox, and 
Sudakov [9] to r{H) < For bipartite graphs, Conlon [8] and independently Fox and 

Sudakov m proved that a stronger bound r{H) < c^n holds. These results are close to best 
possible since Graham, Rodl, and Rucihski m constructed bipartite graphs H on n vertices of 
maximum degree A satisfying r{H) > c^n (for a different constant c). 

Chen and Schelp [6] considered another measure of sparseness. They dehned a graph to be 
p-arrangeable if there is an ordering ui,--- ,Vn of the vertices such that for any vertex Vi, its 
neighbors to the right of Vi have together at most p neighbors to the left of Vi (including Vi). 
This is a measure of sparseness that lies strictly between degeneracy and bounded maximum 
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degree. They showed that graphs with bounded arrangeability have Ramsey number linear in 
the number of vertices, implying, in particular, that the Burr-Erdos conjecture holds for planar 
graphs. Furthermore, Rodl and Thomas m showed that graphs with no Rip-subdivision have 
arrangeability less than p®, and therefore have Ramsey number linear in the number of vertices. 

In this paper, we build upon these developments and settle the conjecture of Burr and Erdos. 
We say that a graph G is universal for a family of graphs if it contains all graphs F G iF as 
subgraphs. For an edge coloring of a graph, we say that a color is universal for a family F if the 
subgraph induced by the edges of that color is universal for F. 

Theorem 1.1. There exists a constant c sueh that the following holds for every natural number 
d, r, and n satisfying n > For every edge two-coloring of the complete graph on at least 

2 d 2 ‘^'^n vertices, one of the colors is universal for the family of d-degenerate r-colorable graphs on 
at most n vertices. 

This settles the conjecture of Burr and Erdos since all d-degenerate graphs have chromatic 
number at most d + 1. Moreover, for fixed values of r. Theorem 11.11 is best possible up to the 
constant in the exponent. To see this, consider a random graph G on (1 — e)2‘^n vertices of density 
^ and let FI be the complete bipartite graph Kd^n-d with d vertices in one part and n — d vertices 
in the other part. It is well-known that in G, with high probability, every d-tuple of (distinct) 
vertices has fewer than (1 — |)n common neighbors. Therefore, G does not contain a copy of 
H. Since the complement of G can also be seen as a random graph of density we see that 
the complement of G also does not contain a copy of H. Another way to see the tightness of 
Theorem 11.11 is by considering the construction of Graham, Rodl, and Rucihski |19] mentioned 
above. 

The density of a graph is defined as the fraction of pairs of distinct vertices that form an 
edge. Most of the previous results mentioned above in fact provide a density-embedding result for 
bipartite graphs, saying that every dense enough graph contains a copy. Note that the Ramsey 
number result follows from such a density-embedding result since in every edge two-coloring of 
a complete graph, one of the colors must have density at least In this context, the following 
theorem generalizes Theorem 11.11 to a density-embedding result. 

Theorem 1.2. There exists a constant c such that the following holds for every natural number d 
and real number a < ^. For every natural number n > , if G is a graph with at least 

vertices and density at least a, then it is universal for the family of d-degenerate bipartite graphs 
on n vertices. 

Note that the complete bipartite graph Kd^n-d mentioned above has a specific structure. 
Namely, every vertex on one side has bounded degree. Until now, even this special case, bipartite 
graphs with one side of bounded degree, of the conjecture was open. A result of Alon [T] implies 
the case when the degrees are bounded by two. The corresponding density-embedding result was 
proved by Fox and Sudakov m, who suggested the general case as an interesting problem to 
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examine. The theorem below addresses this special case of the conjecture and shows that we can 
improve Theorem 11.21 to get nearly best possible embedding results. 

Theorem 1.3. Let d be a natural number and a,e be positive real numbers satisfying < 

e < 1. Let G be a graph on (1 + e)a~'^n vertices of density at least a. Then G is universal for 
the family of bipartite graphs H on n vertices with a vertex partition Wi U W 2 where all vertices 

iTjT/ id 

in Wi have at most d neighbors in W 2 , and |W 2 |(|PV 2 |-i)- ■(|H^ 2 |-rf+i) — 1 

This theorem strengthens previous density-embedding results for bipartite graphs with bounded 
maximum degree um, and is close to being best possible as can be seen by the example where 
G is a random graph and H is the complete bipartite graph Kd^n-d discussed above. Let Qn be 
the hypercube, which is the graph with vertex set {0,1}” where two vertices are adjacent if and 
only if they differ in exactly one coordinate. Theorem 11.31 with e = ^ and a = ^ shows that 
'I'iQn) Li 2^”’ + n^2”’ holds for all sufficiently large n. This bound improves by a constant factor, 
the current best known bound r{Qn) < of Conlon, Fox, and Sudakov [TT], proved using a 

different approach based on the local lemma. It is conjectured [3] that there exists a constant c 
such that r{Qn) < c2” for all n. 

The proofs of the three theorems above are based on dependent random choice and build 
upon several ideas developed for earlier applications. While the proofs of Theorems 11.11 and 11.21 
are technically involved, Theorem II.31 has a short proof that highlights one of the main differences 
in our usage of this technique as compared to the previous ones. We thus start by presenting 
the proof of Theorem 11.31 in Section [2j In Section [3l we introduce some central concepts and 
notation and give a brief outline of the proofs of our main theorems. In Section [H we develop 
the embedding strategy that will be used. The proofs of Theorems 11.11 and 11.21 will be given in 
Section [5] except for one key lemma, which will be proved in Section [6l We conclude with some 
remarks in Section [71 

Notation. For an integer m, define [m] := {1, 2,... , m} and for a pair of integers mi, m 2 , define 
[mi, m 2 ] ;= {mi, • • • ,m 2 }, [mi, m 2 ) ;= {mi, • • • ,m 2 - 1}, (mi, m 2 ] := {mi -|- 1, • • • ,m 2 }, and 
(mi, m 2 ) := {mi + 1, • • • , m 2 — 1}. For a set of elements X, we define = X x ■ ■ ■ x X as the 
set of all t-tuples in X. Let Q be a d-tuple and Q' be a d'-tuple of elements in some set. We use 
QUQ' to denote the (d -|- d')-tuple obtained by concatenating Q' to the end of Q. 

Let G = {V, E) be an n-vertex graph. For a vertex x and a set T, we define deg(x; T) as 
the number of neighbors of x in T, and define deg(x) := deg(x;I/). For a set or an ordered 
tuple of vertices Q, define N{Q]T) := {x £ T : {x,y} £ E,\/y £ Q} as the set of common 
neighbors in T of vertices in Q. Dehne N{Q) := N{Q;V). For two sets X and Y, dehne 
E{X,Y) = {{x,y) £ X xY : {x,y} £ E} and e{X,Y) = \E{X,Y)\. Furthermore, define 
e{X) = ^e{X,X) as the number of edges in X. Let d{X,Y) = density of edges 

between X and Y. For a graph H, an embedding of dd in G is an injective map / : V{H) —)• V{G) 
for which {f{v),f{w)} £ E{G) whenever {v,w} £ E{H). A partial embedding of H in G defined 
on V C V{H) is an embedding of H\y'] into G. For V" C V{H) an extension to V" of a partial 
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embedding / on V is an embedding g : H\y' V] —)■ G such that g\v' = /■ We often abuse 
notation and denote the extended map using the same notation /. 

Throughout the paper, we will be using subscripts such as in qpTl indicate that x is the 
constant coming from Theorem/Lemma/Proposition 1.1. 


2 One-side bounded bipartite graphs 


Fox and Sudakov m asked if the Burr-Erdos conjecture holds for bipartite graphs where all 
vertices in one part have their degree bounded by d. In this section, we answer their question by 
establishing an embedding theorem for such graphs. 


Definition 2.1. Let G be a bipartite graph with vertex partition Vi U V 2 - For a positive real 
number 9 and an ordered d-tuple Q € V 2 , we define the ^-defect of Q as 


^e{Q) 


0 if\N{Q)\>e 

otherwise. 


We simply write uj{Q) when 9 is clear from the context. 


For a given ordered d-tuple of vertices Q, the 0-defect ujg{Q) dehned above captures how the 
number of common neighbors of Q compares to some prescribed threshold 9. We give a penalty 
to the d-tuples having only a small number of common neighbors. Informally, if a set has small 
average 0-defect over d-tuples, then most d-tuples in it will have at least 0 common neighbors. 
This weight function has been considered before by Alon, Krivelevich, and Sudakov [2] with a 
fixed value of the threshold 0. The following lemma, based on dependent random choice, shows 
that we can find a set in which the average defect is small. 


Lemma 2.2. Let d,t be natural numbers, s be a non-negative integer satisfying t > s, and g,£ 
be positive real numbers satisfying e < 1. Let G be a bipartite graph of density at least a with 
vertex partition Ei U V 2 - Then there exists a set A <ZV 2 of size at least |A| > such that 

j^T.QeAd^eiQY < for all 9 < ga'^\Vi\. 

Proof. Since iog is increasing in 0, it suffices to prove the lemma when 0 = r]a'^\Vi\. Throughout 
the proof we will use this fixed value of 0, and use the notation ujg without the subscript. Let 
X E Vf be a t-tuple of vertices chosen uniformly at random (where we allow repetition of vertices), 
and define A = A(X). Note that 



By convexity, we have E Fix a d-tuple Q E V 2 let |A(Q)| 


70 . Since 
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if 7 < 1, then io{QY • P (Q € A'^) = 7 ^ (we used the fact that t > s). On the 

other hand, if 7 > 1 , then uj{Q) = 0 by definition. Hence, 


E 


Y1 

QeAd 


= coiQy • P (g € A'^) < \V2Y • 


Therefore, 


E 


lAl"- 


1 — e 
r/* 


E 

Q&Ad 


coiQY 


>£«'"* IP 21 


Choose X so that the random variable on the left-hand side of the inequality above becomes at 
least as large as its expectation and let A be A for this choice of vertices. Then \ AY > 


and thus |A| > £^/'^a^\V 2 \ ■ The other claim follows from \AY — — 0- 


□ 


We now prove a density-embedding theorem for bipartite graphs with one part having bounded 
maximum degree. Theorem 11.31 follows from the theorem below by taking e > since it 

implies (1 -|- £)a~'^ > and in every graph on n = ni -|- n 2 vertices of density a, we 

can find a bipartite subgraph with ni and n 2 vertices in each part, having density at least a. 


Theorem 2.3. Let £ be a positive real number. Let H be a bipartite graph on n vertices with a 
vertex partition Wi U Vk 2 where all vertices in Wi have at most d neighbors in W 2 . Let G be a 
bipartite graph of density at least a with vertex partition Vi U V 2 where |Vi| > (1 -|-e)a“'^|VPi| and 

IP 2 I > If iw 2 \(\W 2 \-Z^Zw 2 \-d+i) ^1 + ^’ G contains a copy of H. 

Proof. Let 9 = \Wi\ and note that 9 < Throughout the proof we will use oj without 

subscript with the understanding that oj = oj 0 . Apply Lemma 12.21 with ^ 2 ^ = 2, = 1, 

= d, r(22] = T^, and 3^ = to hnd a set A C V 2 of size |A| > (p^)^/'^a^|V 2 | > IW 2 I 
for which J^QfzAd^iQ) < (p^)^= p^l^l'^- By adding edges if necessary, we may 
assume that all vertices in ITi have exactly d neighbors in IT 2 . Let (f be an injective map from 
kL 2 to A chosen uniformly at random. For each vertex v G ITi, let be an ordered d-tuple of 
vertices obtained from N{v) by arbitrarily ordering the vertices. Note that 


E 


v£W\ 


= Y ^HHev))]= Y E = Q) 

vGWi vGWi QgA'* 


<|Wi|- 

QeA<i 


_^(g)_ 

|A|(|A|-l)...(|A|-d+l)- 


Since 'Eq^A-i^iQ) ^ 1^1 ^ l^ 2 |, 


E 


Y w(<('(eg) 

v£Wi 


< IkFil • 


1 |W 2 |'^ 

l + e|lT2|(|W2|-l)---(|W2|-d+l) 


< |kFi|. 
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Therefore, there exists a particular choice of </> such that Ylv&Wi < l^il- Note that (j) is 

trivially a partial embedding of H in G defined on W 2 - We will now extend cj) to Wi. Order the 
vertices in Wi as ui, • • • ,Vni in decreasing order of (where ties are broken arbitrarily). 

We will greedily embed the vertices of Wi following this order. Suppose that we extended (p to 
{ui,--- and for simplicity dehne e* = e„.. First assume that ^ 0. Note that 

i ■u:{p{ei)) < - l^il therefore u:{p{ei)) < Hence, 




uj{(j){ei)) 


> iVFil- 


|Wi| 


= 


Since we have so far embedded at most i — 1 vertices of VFi, we can define (p{vi) as a vertex in 
N{(f>{ei)) \ {(p{vi), • • • , p{vi-i)}. On the other hand, if a;((/>(ej)) = 0, then \N{(j){ei))\ > 9 > \Wi\ 
and therefore we trivially have N{(p{ei))\{4>{vi), • • • , (p{vi-i)} 7 ^ 0 and can dehne 4 >{vi) as a vertex 
in this set. Thus, we can hnd an embedding of H in G. □ 


3 Preliminaries 

3.1 Decomposing H 

We start with the following simple lemma that decomposes a given degenerate graph into man¬ 
ageable pieces. 

Lemma 3.1. Let H be an r-colorable d-degenerate graph on n vertices. Then there exist a natural 
number k and disjoint subsets with the following properties: 

(i) k < logan, 

(a) for all (z, j) G [k] x [r], 

(Hi) for all j € [r], the set Uig[fe] is an independent set, and 

(iv) for all {i,j) G [k] x [r], each vertex in has at most Ad neighbors in Ui'>i j'e[r] wiP. 

Proof. Let Ui = V{H). For each z > 1, dehne C/j+i C Ui as the set of vertices having degree 
at least Ad in the subgraph H[Ui]. Since H is d-degenerate, there are at most d\Ui\ edges in the 
subgraph H[Ui]. Therefore, we have 4d|C/j_|_i| < 2d\Ui\, forcing |17i+i| < ^\Ui\. Since H has n 
vertices, we have \Ui\ < for each z > 1, and the process continues for k < log 2 n steps. 

Dehne Wi = Ui\ C/j+i for all i G [/c]. Consider a proper r-coloring of H using [r], and for each 
j G [r], dehne as the set of vertices of color j in IFj. One can easily check that all the 

conditions hold. □ 

3.2 Defect and average defect 

We will work with the decomposition given by Lemma 13.11 The main objective is to hnd a 
partition of the vertex set of the host graph G so that we can embed to piece 
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by piece. As in the previous section, the existence of such an embedding depends on the ‘average 
defect’ of this partition. In this subsection, we introduce the central concepts and notation related 
to the defect function. We slightly generalize Definition 12.11 so that we consider the number of 
common neighbors in a specific set. 

Definition 3.2. Let G be a graph. For a positive real number 0, a set T C V{G), and an ordered 
tuple Q of vertices, we define the 0-defect of Q in T as 


T) 


0 if\N{Q;T)\>9 

otherwise. 


We may simply write a;(Q; T) when 6 is clear from the context. 


One can easily check that monotonicity u! 0 {Q-,T) < uj 0 '{Q']T') holds for all 6 < 6 ', Q C Q' 
(as sets), and T D T'. Note that in the previous subsection, we utilized Lemma 12.21 only for the 
case s = 1, even though Lemma [2. 21 provided a variety of bounds. Prom now on, it will be crucial 
to consider other values of s. 


Definition 3.3. Let G be a graph and d, s be natural numbers. For a positive real number 9, a 
set T C V{G), and a set Q C V(G)'^, we define the s-th moment of the 0-defect of Q in T as 

Q&Q 

We may simply write ps{Q]T) when 9 is clear from the context. We extend the definition to s = 0 
by defining ui 0 {Q]Ty to be 0 when uj 0 {Q;T) = 0, and 1 otherwise. 

The monotonicity of the defect function iw and the fact that it equals either 0 or a real number 
at least 1 implies ^s,d{Q]T) < for s < s', 9 < 0', and T D T'. Note that the 

conclusion of Lemma [2.21 can be re-written as an upper bound on Most previous 

applications of dependent random choice were proved by controlling the 0-th moment of the 0- 
defect function, which equals the probability that a random uniform d-tuple in Q has fewer than 
0 common neighbors, whereas here we will be considering higher moments. We will focus on the 
cases when Q = Ai x ■ ■ ■ x for some (not necessarily distinct) sets Aj. The following proposition 
provides a useful relation between average defects of such product sets. 

Proposition 3.4. // Ai, • • ■ , Ad, T are vertex subsets, then ^s,6»(nf=i ^ hs,e(Wi=i Ai;T). 

Proof. Since u] 0 {Q;T) < lv0{Q'-,T) for all Q C Q', 

E MQxr< E E ““WuMirr) =i^ E ^niQ-rr- 

Q^ntiW Qent/AiV J Qenti^i 

Divide both sides of the inequality by Oti 1^*1 to obtain ^) < Ms,e(nf=i ^i]T). □ 




The following proposition shows that the contribution of d-tuples with repeated vertices to¬ 
wards the average defect is of small magnitude. 

Proposition 3.5. Let Ai, ■ ■ ■ ,A(i be vertex subsets all of sizes at least m, and let T be a vertex 
subset. Let dQ C Hieid] 0 / d-tuples (ui, • • ■ , Vd) for which Vi = Vj for some distinct 

i,j € [d]. Then ' 

Proof. For each distinct a,b £ [d], define Qa,b as the set of d-tuples in dQ whose a-th and 
6-th coordinates coincide. Note that if Q = (ui,-- - ,Vd) and Vd-i = Vd, then uJo{Q‘,T) = 
,'i’d-i)]T). Therefore, by Proposition [SiH 

^ ue{Q-,Ty= Yl ^e{Q-,Tr<T^. Y ^e{Q-,Ty. 

QeQd-i,d Q&YltiAi Q&YltiAi 

Since \Ai\ > m for all i £ [d], the conclusion follows by summing the inequalities over all pairs 
a, 6. □ 

The next proposition asserts that for a pair of sets Vi and V 2 , if the s-th moment of the average 
defect of Vf into V 2 is small, then all d-tuples in Vf have many common neighbors in V 2 . 

Proposition 3.6. Let d,s, and 9 be natural numbers. For all sets of vertices Vi and V 2 , the 
number of d-tuples Q £ Vi satisfying |A^((5; V 2 )| < jv^d/7 is less than iJLs,e{Vi]V 2 ). 

Proof. If a d-tuple Q £ Vf satisfies \N{Q; V 2 )\ < then by definition, we have ove{Q] 1/2)^ > 

iPil'^. Therefore, the number of such d-tuples is less than pspiVi', F2)- D 

We will use Proposition 13.61 mostly in the case when pLsp{Vi]V 2 ) < 1. For such cases, all 
d-tuples Q £ Vf satisfy |iV(g; ^ 2 )! > 

3.3 Outline of proof 

Let dd be a d-degenerate graph and G be a host graph that we would like to embed H into. Let 
{IFi}jg[fc] be the partition of the vertex set of H obtained from Lemma [3T] and suppose that we 
have disjoint vertex subsets {Vi}ig[A:] of G. We will embed Wi in Vj one vertex at a time, starting 
at i = k and proceeding in decreasing order of index. Suppose that we have successfully embedded 
Wj+i in Vj+i (and all previous steps) and are about to embed Wi in Vj. Let (f> denote the partial 
embedding that we have at that moment. For each vertex v £ Wi, define N~^(v) = dV(u)n|Jj^j Wj. 
For simplicity, we assume that |A^'’'(u)| = d for all v £ V{H). We can force the extension of (f to 
Wi to be a homomorphism by embedding each vertex v £ Wi to a common neighbor of vertices 
in <f>{N~^{v)). To obtain an embedding, we must additionally guarantee that (p is injective. Let 
be an arbitrary ordering of the tuple N^{v). If the map constructed at the previous steps were 
well chosen so that the sum 

Y ^e{(t>iev)-,Vi) ( 1 ) 

veWi 
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is small, then we can proceed as in the proof of Theorem 12.31 to extend the map to VFj. 

To briefly describe how to obtain control on ([T]), we consider the case when all vertices in 

V € Wi satisfy N~^{v) C Wi+i. In this case, if the embedding 4>\wi+i was chosen as a uniform map 

from Wi+i to Vi+i, then we would expect the sum above to be Hence, we can 

make ([H) small by finding sets having small average defects in an appropriately defined 

way. However, even if we are given such subsets {Hjjjgjfc], our assumption that 4>\wi+i is a uniform 
random map cannot be true. Nevertheless, we show that ([I]) is not too far from Hj) by 

showing that our map is not too far from the random uniform map. For example, consider the 
simple case discussed above where all vertices in u € Wt satisfy N~^{v) C IFj+i. For a fixed vertex 

V € Wi, let u;i, ■■■ ,Wd he its neighbors in N~^{v) and let = {wi, ■ ■ ■ ,Wd)- If the embedding 

was chosen uniformly at random, then the image of Wj would be chosen in the set V^+i, but our 
algorithm chooses its image in N{(f)[ew.)] Vi+i) instead. Hence, compared to the random uniform 
map, the expected value of Fi+i) could be larger by at most a factor Ojelii] |A''( 0 (!r“Ty+i) | • 

Note that either \N{^{ey,.)]Vi+i)\ > 6 oi ); Fj+i). Therefore, by 

the inequality of arithmetic and geometric means, 


n 

ie[d] 


\N{4>{e^^)-Vi+i)\ 


1 

< - 

- d 



\V^+l\ 



0 '^ 


(^1 + oj0{(j){ewj)',Vi+i)'^'^ . 


The second term in the summand in expectation has value ^*+i) this quan¬ 

tity is less than 1 and everything worked as planned, then the right-hand side would be at 

most 2 . This means that in expectation, the sum ([1]) would be at most 2 • 

p-i,e{V^^i;Vi). In general, if N~^{v) % kFj+i, then there will be more dependencies between the 
random variables and it gets trickier to control the events. However, the underlying idea, com¬ 
paring our algorithm with the uniform random map and measuring its deviation using higher 
moments of the average defect function, remains the same. We will formalize this idea in Sec¬ 
tion [H 


4 Embedding scheme 

In this section, we develop the embedding scheme. The main new ingredient of our embedding 
algorithm compared to previous ones lies in the randomness. We recall the proof of Theorem l2.3l to 
illustrate the essence of this difference. There, we embedded a graph H with bipartition Wi U H 2 
into a graph V with bipartition Hi U I^. We started by finding a set H C V 2 with small defect 
into Hi using dependent random choice. This step is identical to that from previous applications 
(except that we imposed a weaker bound on the defect function). In the second step, we found a 
map (p from IH 2 to V 2 . The main difference lies in this step. In previous applications, it was done 
greedily so that for all v € IHi, the set (j){N{v)) has many common neighbors. For our purpose, 
the weaker bound from the first step makes this strategy not viable. Therefore, we took a random 
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map from W 2 to V 2 and used the first moment to find a map for which the number of common 
neighbors is controlled by fi(A'^;Vi). Finally, we embedded Wi in Vi so that each vertex v G Wi 
gets mapped to a vertex in the common neighbor of (p{N{v)). The difficulty in extending this 
algorithm to non-bipartite graphs lies in the fact that there is no clear distinction between the 
second and third steps. That is, when mapping a vertex v G Wi to the common neighbor of 
(p{N(v)), we must retain randomness so that the process can be continued. 

Let G be a graph with disjoint vertex subsets and iL be a graph with a vertex 

partition {VFjjjgjfc] into independent sets where each vertex in Wi has at most d neighbors in 
Wi+i U • • • U Wk for all i G [fe — 1]. Let be natural numbers. For i G [fe — 1] and a vertex 

X G Wi, define N+{x) = N{x) n Uje[i+i,fc] Wj. Add edges to H if necessary so that |A'^(x)| = d 
for all vertices x G V{H) \ Wk- For i G [k\ and x G Wi, let Cx be an (arbitrary) ordered d-tuple 
of vertices formed from N^{x). Define a random map if: : V{H) — V{G) using the following 
‘random greedy’ process 

1. Take an injection from Wk to Vk uniformly at random. 

2. For i G [k — 1], given a map V’ defined on Wj+i U • • • U Wk, we then extend ■0 to Wj. Let 
xi,X 2 , ■ ■ ■ ,Xm be the vertices in Wi ordered so that uJg^{^l>{exj);Vi) is decreasing in j (ties 
are broken arbitrarily). 

3. After embedding xi, ■ ■ ■ ,Xj-i, define Cj = Cx^ and Lj = Vi)\{0(xi), • • • ,'0(xj_i)} 

as the set of available vertices for Xj. Embed Xj using the first valid option among the 
following; 

3-1. If N{'ip{ej)-, Vi) = 0, then let '>p{xj) be a vertex in Vi chosen uniformly at random. 

3-2. If \Lj\ < i|A'('0(ej); Vi)|, then let V’(xj) be a vertex in N{ip{ej);Vi) chosen uniformly 
at random. 

3-3. If \Lj\ > ^|A'(0(ej); V))|, then let 0(xj) be a vertex in Lj chosen uniformly at random. 

If we only run Steps 3-2 and 3-3 (but never Step 3-1) throughout the embedding process then 
the resulting map is a homomorphism from H to G. As we will later see, Step 3-1 can easily 
be ruled out if we have control on the defect function so one can safely assume that tp is always 
an homomorphism from H to G. Given that 0 is a homomorphism, it suffices to show that 0 is 
injective, in order to show that it is an embeding. Thus, it will be crucial to understand when we 
will run Step 3-3 instead of Step 3-2. 

Throughout the proof we consider the defect function ujq{Q-,T) for various different choices of 
9, Q, and T. Since these parameters will always be a function of the vertex x G V{H) that we are 
about to embed, we make the following definitions to remove redundant information and simplify 
notation. 

Definition 4.1. Suppose that we are given {110}ie[fc]; and For i G [/c — 1] and 

X G Wi, make the following definitions: 

(i) 9x '■= 9i and I 4 := ki¬ 


ll 



(a) Lo{x\'il)) := 

(Hi) Qx '■=Vi^ X • • • X Vi^ where Wi^ x • • • x Wi^ is the unique product space containing ex- 

(iv) fj,s{x) := tJ-s,e^{Qx] Vx) and Hs := max^(zv(H) hsix). 

(v) 7 := max|l,maxig[fc_i] 

For X G V{H), if ps{x) is finite, then there are no d-tuples Q & Qx having N{Q]Vx) = 0. 
Hence, we will never run Step 3-1 in such case. For a vertex x G Wi, the parameter 9x = Oi is 
the defect that is of interest when embedding x in V^, as whether we run Step 3-2 or 3-3 depends 
on \N{'il){ex)'-, 14)1 which is closely related to a;(x; V') = oj 0 ^{'tlj{ex); 14). Note that the enumeration 
of vertices in Step 2 is determined by the values u}{x',ip). Further note that if the embedding 
algorithm chose the image of each vertex y Cx as a random uniform vertex in Vy, then p-six) 
would be the expected value of u}{x','ipY. This is an important observation, since we will later see 
that even though 4 does not always choose a random uniform vertex, 7 can be used to bound 
the difference between the expected value of Lo{x]Yy defined by our random process and by the 
random uniform case. The following theorem establishes a sufficient condition for 4 to be an 
embedding. 

Theorem 4.2. If |14| > 2|H4| and 6i > 2|VFj| for all i G [/c — 1], then the probability that if does 
not induce an embedding of H in G is at most 

The rest of this section focuses on proving Theorem 14.21 The proof of Theorem 11.11 then 
follows by finding subsets of vertices of G satisfying the condition of Theorem 14.21 

4.1 Proof of Theorem 14.21 

For technical reasons, we consider auxiliary graphs. Add a set H4+i of d vertices to H, make 
the bipartite graph between H4 and Wk+i complete, and between Wi and Wk+i empty for all 
i G [k — 1]. Denote the resulting graph as H'. Define N~^{x) := N{x) D Wk+i for each x G Wk- 
Note that now, for all x G V{H), we have |A'+(x)| = d. Let G' be a graph obtained from G 
by adding a set I 4+1 of d vertices adjacent to all other vertices. Define Ok = |14| and note that 
(jj 0 ^.{Q;Vk) = 0 for all Q G Further note that for all x G Wk, we have uj 0 ^,{x-,if) = 0 and 

Ps{x) = 0 regardless of the choice of if and s since G Note that Definition 14.11 now 

extends to i = A; as well. The parameters for i < k — 1 do not change depending on whether 
we consider the graphs H,G, or the graphs H',G'. Furthermore, 7 is identical when defined for 
the pair G or in G', since = 1. Thus we abuse notation and use the parameters dehned in 
Definition 14.11 as if they were defined for H',G'. 

Throughout this section, we will apply the embedding scheme defined above to embed H' to 
G', but with a slightly modified first step: 

1’. Take a map from Wk+i to 14_|_i uniformly at random (instead of a random injection). 


12 




We then embed the rest of the graph using the same algorithm (where we also consider i = A: in 
Steps 2 and 3). Note that restricted to the vertices V{H) has the same distribution as the map 
previously defined without the sets Wk+i and I 4 + 1 . 

The following lemma gives a sufficient condition for ^ to be an embedding. 


Lemma 4.3. Let G, G', and xjj be as described above. Suppose that 

di > 2|lTj| for all i G [/c]. For a natural number s, let cj) : V{H') —>■ l^(G') be a map satisfying 
P(V’ = <^) > 0 and YlxeWi ^ ^ f’WiH) is an embedding of H in G. 


Proof. As discussed in Section^ we have YlxeiWi 4>) < YlxeWi therefore it suffices 

to consider the case s = 1. Let (f : V{H') V{G') be a map satisfying 

for all i € [k]. Fix i G [k] and condition on the event that = cj) after mapping the vertices 
Wj+i, • • • , Wk- Let xi, X2, • • • ) Xm be the vertices in Wi in decreasing order of uj{xj] <f) and define 
Cj = ^xj for each j G [m]. Note that this is the order that the vertices in Wi will be mapped to 
Vi. Consider the j-th step. Since hnite, we have N{(j){ej)]Vi) 7 ^ 0- Hence, we 

will not run Step 3-1 when mapping Vj. 

If uj{xj\4>) = 0, then \N{(j){ej)-,Vi)\ > Oi > 2\Wi\ and thus we determine f^{xj) according to 
Step 3-3. If u!{xj;(j)) ^ 0, then by how we ordered the vertices, we have j ■ oj{xj\(fi) < ^0i, and 
thus 




from which it follows that \N{(f(ej)',Vi)\ >2j. Since we embedded at most j — 1 vertices of Wi prior 
to Xj, we determine ifixj) according to Step 3-3 when embedding Xj. Since P{^p{xj) = 4>{xj)) > 0 
(conditioned omp = f for all previous vertices), we see that 4>{xj) is in N{(j){ej); V) and is distinct 
from all previously mapped vertices. Since the analysis applies to all steps of the embedding, we 
can conclude that 4>\v{h) is injective, and thus an embedding. □ 


Lemma [4.31 shows that it is crucial to control the quantity V')*- Not® fbat the 

expected value of Lo{x;'tpY is fis{x) if Yi^x) were uniformly distributed in Q^. This was the case 
in the proof of Theorem 12.31 and there, thus we were able to conclude that i® 

small quite straightforwardly. Now our situation is more complicated, since ipi^x) is no longer 
uniformly distributed in Qx. We gain control on the sum by comparing our distribution with the 
uniform distribution. 

Let v be the distribution on the set of maps p : V{H') —>■ V{G') obtained by the random 
greedy embedding algorithm defined above. For a set of vertices I C V{H), we say that is a 
probability distribution obtained from v by neutralizing I if in the random greedy process above, 
every time we embed a vertex x G /, instead of following Steps 3-1, 3-2, or 3-3, we choose the 
image of x as a uniform random vertex in I 4 . For example, if we neutralize all vertices, then 
the resulting distribution is a uniform distribution over all maps cj) : V{H') —>■ V{G') satisfying 
(piWi) C Vi for all i G [A: + 1]. 
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From now on, we use V' to denote a random map from V{H') to V{G') whose distribution is 
determined by the probability measure under consideration. In contrast, we use (j) to denote fixed 
(non-random) maps. Recall that 7 = max|l,maxjg[^] ^|. 

Lemma 4.4. Let R C V{H) be a subset of vertices and let vi he the distribution obtained from 
V by neutralizing Ii. Let X be a non-negative random variable depending only on the images of 
vertices in J for some J C V{H). Define I 2 = Ii U J and define 1^2 as the distribution obtained 
from V by neutralizing l 2 - Suppose that t = | J \ /i| > 1. Then 

E,JX]<2VE,,[X]+22*-V%,[x2] + 1 ^ 

yeJ\h 

Therefore, if X‘^ > X holds with probability 1, then 

E,JX] < 22V‘E,JX2] + 1 ^ E,Ju;(y;V’)'‘]. 

y&Ah 


Note that the first term on the right-hand side, under the probability measure 1 ^ 2 , is 

determined completely by a set of vertices whose images are chosen uniformly at random. Hence, 
this lemma allows us to compare the distribution on maps defined by our random greedy algorithm 
with the random uniform map. 

Proof. For each vertex y € AX h, dehne Cy{4>) = ) | • For a set IF C V{H') of size 

|IF| = t, fix a map ^ : IF —)■ F(G'). We use the notation 'ift = (f> to indicate the event that the 
random map '0 obtained after embedding the first t vertices is 0. Define Iw = A \ {A U IF). Let 
izw be the distribution obtained from ni by neutralizing R U Iw- We prove that for all IF, t and 
0 as above. 



At = A 

A: 

X- n Cy{A) 

At = A 




y&iw 



We prove ([2]) by (downward) induction on t. If t = \V{H)\, then Iw = 0 and so the random 
variables on both sides of the inequality equal X. Therefore, ([2|) trivially holds. 

Let us now investigate the value t while assuming that ([2]) is true for all larger values. Let IF 
be a set of size t and 0 be a map defined on IF. Conditioned on xft = 0) we know which vertex 
will be embedded next. Say that it is xt+i £ IF* for some i € [A:]. Define IF' = IF U {xt+i}. For 
each z E F, let 02 be the extension of 0 obtained by defining 0(xt+i) = z. Then 


E, 




At = 


/ ^ Pi'i idPt+i — Az 

zeVi 


I = 0 ) E,, 




At-hi 
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Therefore, by the inductive hypothesis, we have 


E,,j X ^t = (l> < ^ 'Pui (V’t-hi = 4 >z\i^t = (j)) E,.^, 

X- n Cy{i^) 

1 

II 

+ 

z&Vi 

r 

y(ziw' 

J 

II 

M 

+ 

II 

o 

II 

X- n c',(V’) 

5. 

+ 

II 

0 

Z&V^ 

y&iw' 

J 


where the second equality follows since the distribution of vyi// and vyy differ only on the image 
of xt+i which is fixed once we condition on = </> 2 - 

If Iw' = Iw, then xt+i ^ h, and thus VuAi’t+i = = </>) = 'Puwi'^Pt+i = = </>)• 

Therefore, the right-hand side above is 


X] (V’t-hi = (t>z\'4’t = 4>) 

zeVi 


X • Yl Cyii^) 

+ 

II 

_1 

— ^l/w 

X • n Cyii^) 

1pt = (p 

y&lw 

J 


y&iw 



proving our claim. On the other hand if lyy' ^ Iw-, then lyy/ = lyy U {xt+i) and xt+i G J \Ii- 
Note that the image of Xt+i, under the probability distribution ui, will be chosen uniformly in a 
set of size \Vi\ (Case 3-1), |At((/)(ea;j_^^); V))| (Case 3-2), or at least (Case 3-3). Since 

= <(>2 I V't = <(>) = 1 ^, it follows that 

2 

Pi.1 (V’t-hi = </>21 V’t = <(>) < = C'xt+i(</>) • = (t)z\i’t = 4>)- 


Therefore, 


E,y^ X 1pt = 4) < ^ P,/l {'Ipt+l = 4>z\'4’t = 4>) Eu^y 

^ • n ^y(^) 1 V’t+1 = 4>z 


z&Vi 

1 y^iw 

- 


- ^^t + l{4>) ■ 'Ptzyy {i’t+l =4>z\'4’t 

= 4>) 

X- n Cyii;) 

+ 

II 

0 

z&Vi 


y&iw' 

J 


2|y I / \ '^ 

Since C',r,+i(V’) = ^^t+i ^ (Ui>i+i Wj) , the value of C',r,+i(V') is determined 

once we condition on V’t+i = 4>z- Since Iw = Iw' U {uj+i}, we have 


E 




X 


i^t = 4> < X] = ()>21 V’t = 0) E 

z&Vi 


Uw 


X ■ n c,(v^) V’t+ 

y&iw 


1 = (Pz 


= E, 




X- n Cy{^) 

y&lw 


i^t = 4> 
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This proves the claim. 

The inequality ([2]) for t = 0 gives < Ejyj X ■ nj/e/ 2 \h • ^o^e that Cylip) = 

^ {4- Therefore, if I /2 \ Al = f, then 


n C,{</.)< 2 ‘ n l^rll 


/ max{l,a;(y;V')} 


y&l2\Ii 


y&hXh 


0„ 


< 


JJ ^ (max{l,a;(y;V')})* < 2*7^^ Y (^ +V')*)- 

\ y&l2\li y J yehXh yehVi 


Hence, for C = 2 * 7 *, 

E,^[X]<E,,[CX] + ^ E,, [CX.a;(y;V^)‘] 

yehXh 

< E,3 [CX] + 7 j; ^ (E., [^2^2 + a;(y; ^|;)^^]) 
y&l 2 \h 

<E,^[CX] + \e,,[C^X^]+^^ Y, E,Ju;(y;V'f]. 


□ 


y&l2\h 


We plan to gain control on the defects co^x^tp) by using Lemma l4~4l As explained above, the 
first term in the right-hand side of Lemma 14.41 gives a direct comparison between our process 
and the random uniform map. However, the second term in the right-hand side of Lemma 14.41 
is still problematic since it is in general determined by vertices that are not yet neutralized. 
We repeatedly apply Lemma 14.41 to further gain control on these terms. As we proceed, the 
set of neutralized vertices further propagates and eventually there will be no vertices left to be 
neutralized. 


Lemma 4.5. For all x € V{H), E [u;(x; 

Proof. For a vertex x € Wj, let To be the vertex-weighted rooted tree labelled by vertices in V(H) 
with a single root vertex labelled x having weight 1. For j > 0, suppose that we constructed 
a weighted rooted labelled tree Tj, and let Lj = V{Tj) \ H(Tj_i). For a node a € Lj, define 
ya G y{H) as the label of a, and P{a) as the set of vertices on the path from the root to the 
parent node of a in the tree Tj. Let I (a) = W^+i U|J^gp(-^) N~^{yif) and let F{a) = N^{ya) \ I{a). 
For each vertex y € T(o), add a child to a labelled y and let its weight be 2 ]f^’ where a{a) is 
the weight of a in Tj. Let T^+i be the tree obtained by doing this process for all a £ Lj. Note 
that the process eventually stops since each path from a root to a leaf has labels of the form 
{ViiiVn: ■ ■ ■ f ?/*s) vertices 2/7 € Wi- satisfying H < *2 < • • • < Let Tt be the final tree, i.e., 
t is the minimum integer for which Tt = T+i. We may assume that Ti C T 2 C • • • C T. 

Define 7 ' = 2 ^‘^ 7 ^‘^. Let v be the probability measure on maps ip : V{H') V{G') induced 

by our random embedding algorithm. For a tree Tj and its node a, let Ua denote the probability 
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measure obtained from v by neutralizing I (a). We claim that for each j > 0, 


IE„ 




2d 


aGV(T’j_i) a^Lij 


where for j = 0 we let T_i be the empty graph. The root node r is the unique node in Tq and has 
weight a{r) = 1. Moreover, and 12 have identical distribution since I{r) = Wk+i and we defined 
the map from Wk+i to 14_|_i to be a random uniform map. Therefore, the inequality above holds 
for j = 0 (in fact equality holds). 

Suppose that the claim holds for some j > 0. For a vertex a & Lj, let r(a) be the set of 
children of a in Tj+i and let t'o be the probability measure obtained from Va by neutralizing 
the vertices in N~^{a). If r(a) = 0, then iV+(ya) C 1(a) and thus 'Kjj^[uj(ya;ip)‘^^] < y 2 d < ^Ad- 
Otherwise if r(a) 7 ^ 0, then for all b € r(a), we have z^o = ^'b- Therefore, by Lemma 14.41 we have 


< 7 ' ' IEi/o[w( 2 /a; + 


1 

2^ 


ber(a) 


(3) 


Since the images of N~^(a) are neutralized in the measure uq, by definition we have [uj(ya] < 
fdid- Since 7 ' > 1, we will use the same bound ([3]) for nodes a € Lj having r(o) = 0 with the 
understanding that the second term equals zero for such nodes. Therefore, by the inductive 
hypothesis and the fact |r(a)| = |T(a)| < d for all a € Lj, we have 


E,, 




< cr{a)E^^[uj{ya-,^)‘^'^] 

aGV(T^_i) a^Lj 


< 


'=^( 0 ) 7 ^ 4 ^ + 

aGV(T^_i) aSLj 


7 ■L4d + 


1 


2|F(a)| 


^ E^^[uj{yb-,'ilj) 


2d] 


feer(a) 


= X] ^(«) 7 V 4 d+ ^ a(a)E^^[uj{ya-,'tp)‘^‘^], 

aGV {Tj) aE-Lj-j-i 


thus proving the claim. 

Since Tt = Tt+i, in the end we see that E^ [u;(a:; < '^aeV{Tt) L 4 d- For each 

j > 0, let Uj C V(Tt) be the set of nodes of Tt that are at distance j from the root node. Since 
'^(®) = Z]6er(a) 2(7(6) holds for every non-leaf node a G V(Tt), a simple recursive argument implies 
YlaGU- 2-^< 7 ( 0 ) < 1, or equivalently Yla^u — b- Since Tt is a finite tree, this 

implies 

'"W£EE‘’('^)sE2'' = 2. 

aeV(Ti) j>0 aGUj j>0 



Therefore, 


E„ 


u}(x;^p) 


2d 


- o-(a)7V4d < 2^‘^+V^'^/^4d- 

a&ViTt) 


□ 
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Lemma [^31 can be used to control sums of the form Ylx&Wi appear in Lemma [33l 

Using it, we can prove Theorem 14.21 which gives a quantifiable condition which will guarantee the 
existence of an embedding. It is equivalent to the following form since we defined 9^ = |14| when 
adding the artificial set Wk+i at the beginning of this subsection. 


Theorem 14.21 If 9i > 2|ILi| for all i € [k], then the probability that V' does not induce an 
embedding of LI in G is at most 

Proof. Dehne A* = for each i G [k]. If [k]W ^ h then we have Aj < ^9i 

for all i G [A:], and therefore by Lemma 14.31 if induces an embedding of H in G. Note that 
Afc = 0 since uj{x\'if) = 0 for all x G Wk- Thus, ^l) may not induce an embedding of LI in G 


only if 
most 2E 
Therefore, E [A*] < \V 


Ai 

e[fc] ei 

Ai 

ie[fe—1] 9i 


- Eie[fc- - 1 ] ^ > 2 ' Markov’s inequality the probability of this event is at 
By Le mm a l431 for all x G V{H), we have E [a;(x; 


22d+l„,2d 


7 fJ.4d, and 


2E 


E Ai 

^G[/c—1] 


< 2 • ^ 2^'’*+ 
1] 


ll^*L,2d,. ^ r,2d+2,,2d,, 1^* 

-7 < 2 ^7 /24d 2^ 

ie[fc—1] * 


□ 


5 Proof of the main theorem 


In this section, we prove Theorems 11.11 and 11.21 We first prove the bipartite case. Theorem 11.21 
for which a density-type embedding result holds and the proof is slightly more simple. We then 
prove our main theorem, Theorem ll.il in Subsection 15.21 

The following lemma is a slight variant of Lemma 12.21 for non-bipartite graphs. Since we need 
to add further conditions to the conclusion, instead of stating the outcome of dependent random 
choice in terms of a particular set, we state it in terms of the expected value of the random 
variables of interest. We omit its proof since it follows from the proof of Lemma 12.21 after making 
straightforward modifications. 

Lemma 5.1. Let d, s and t he natural numbers satisfying t > s, and rj, a be positive real numbers. 
Let G be a graph with two sets Vi, V2 C U(G) satisfying e(Vi, V2) > «|Ui||U2|- Let X. be a t-tuple 
in U/ chosen uniformly at random and let A = A'(X; 1 / 2 )- Then for all 9 < 7 q;'’*|Vi|, 


E[A'’*] > and E 


QeA'i 




In particular, this implies the existence of a set A such that |A| > ^a*|V 2 | and /Us(A 2 ;Ul) < 2rf. 


The outline of the proof in both cases, bipartite and non-bipartite, are the same. We first 
prepare the host graph by repeatedly applying Lemma [5T] to find a collection of r sets 
that have small average defect towards each other. This framework was first developed by Kos- 
tochka and Sudakov in [26j and its variants have been used in much subsequent work [laiiniisT]. 
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Although the statement we need easily follows from the proof of Fox and Sudakov’s application 
[16| . we cannot use their work as a blackbox since we must control higher moments of the defect 
function. The following proposition summarizes the main observation underlying this framework. 


Proposition 5.2. Let G be a graph with sets Vi, A 2 C V{G). Let iK be a t-tuple in chosen 
uniformly at random and let Ai = A^(X; Vi). Then 


E 


^s,e(A2; Ai) 




Proof. For all Q G A 2 , we have A((5; Ai) = A(QuX;Vi). Therefore, a;6)(<5; Ai)^ = a; 0 (QUX; Vi)^, 
where we consider Q U X as a (d + t)-tuple. Hence, 

E [ueiQ] Ai)^] = E [ueiQ U X; Fi)^] = ^ ue{Q U T; Fi)^ • P(X = F). 

Since P(X = F) = -^ 4 ^, it follows that 


E 


Ai^ 


= E 




we((5;Ai 


Q&A<^ 


IA2I 


d+t 


Y, a;,(QUF;Fir. 


Q&A^Y&A\ 


The last expression equals /is,e(A 2 ^*; Fi) by definition. 


□ 


By repeatedly applying Proposition 15.21 following techniques developed in previous works, we 
can obtain a collection of r sets that have small average defect towards each other. Once 

we are given such a collection, we use the following lemma producing sets {vY'\ that are needed 
for the embedding scheme of Section 01 Its proof follows from standard probabilistic methods but 
is rather technical and thus will be given separately in another section. For an r-tuple of sets 
{Aj}jg[^], we define = Ui'e[r]\{i} each j € [r]. 

Lemma 5.3. Let k,d,s,r be fixed natural numbers satisfying r > 2, s > 4d, and e,e' be fixed 
positive real numbers. Let m be a natural number satisfying m > , 

and 9 be a natural number satisfying 9 > em. Let pi for i G [k] be positive real numbers satisfying 
Ylie[k]Pi — ^ Pi — for all i G [k]. Suppose that {Ajj^gj^] are vertex subsets with 

em < I A, I < m, satisfying pLsfi{A'f-] Aj) < ^ for all j G [r]. Define 9i = ^Pi9 for all i G [k]. 
Then there exist sets {vY}{i,j)e[k]x[r] satisfying the following conditions: 

(i) For all i G [fc] and j G [r], we have ^ Aj and |f/A| < pi\Aj\. 

(a) For all {{i,j), (hJi), ■ , {idJd)) S {[k] x [r])‘^+^ satisfying ji, ■■■ Gdfi^ j, 

PsA ( n <max{£', 8 /e-Ve(Al^.;A,)}. 

\ae[d] / 

Moreover, if r = 2, then the factor r^£-'^ can be replaced by 1. 
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(in) For all {i,j) ^ {i',j') the sets and ^ are disjoint. 

We would like to note that the improvement for r = 2 in item [ii) plays an important role in 
the proof of Theorem 15.51 as it allows us to prove a single exponential bound for bipartite graphs 
(as opposed to the double exponential bound for general graphs). 


5.1 Bipartite graphs 

We first find a pair of sets {Ai,A 2 ) for which the d-tuples in Af have small average defect into 
A 2 , and vice versa for d-tuples in ^ 2 . This will be achieved by applying Lemma l5.II twice. First 
application will give a set A 2 such that the d-tuples in A 2 have small average defect into V{G). 
We then apply the lemma again by choosing a random set X C A 2 , to obtain a set Ai whose 
d-tuples have small average defect into A 2 . Proposition 15.21 can be used to show that this pair of 
sets has the claimed properties. 


Lemma 5.4. Let m,d,s,t be natural numbers and a,rj be positive real numbers satisfying t > s 
and rj < . Let G be a bipartite graph of minimum degree at least am with vertex partition 

Vi U V 2 where |Vi| = IV 2 I = "m. Then there exist sets Ai C Vi and A 2 C V 2 satisfying the following 
properties: 


(i) l^il > for both i = 1,2, 

(ii) for all 6 < yisfi{Af\A 2 ) < 2?]*/^ and < 2r/*/^. 

Proof. Since ujqi < uig holds for all 6' < 6, it suffices to consider the case when 6 = 
Throughout the proof, we will consider uj and [i with this fixed value of 0 and hence will omit 0 
from the subscripts. Since 0 < r]a'^~^^\Vi\, we can apply Lemma [5.11 with = d + t to find a set 
A 2 C V 2 of size 1^421 > ^a*|V 2 | such that ^ 1 ) — 

By the minimum degree condition on G, we know that the subgraph of G induced on Vi U ^42 
has density at least a. Let X be a t-tuple in A 2 chosen uniformly at random and let Ai = A(X). 
Since 6 < r]a'^\A 2 \, Lemma [5T] with (Fi)[ 57 i] = ^2 and (P2)[5T]~ implies 


E 


lAil'" 


1 

V 


QeAf 




By Proposition 15.21 it follows that E[/is(A 2 ; Ai)] = Pi) < 2 t/*. Since 


E 


QeAf 


277V2 ‘ 




1 

- 4" 


dt 


i^ir 


Let A be a particular choice of X for which the random variable on the left-hand side becomes at 
least as large as its expected value, and let Ai be Ai for this choice of X. First, |Ai|'^ > 
implies |Ai| > ia*|Pi|. Second, |Ail'^—^ a;((5; A 2 )* > 0 implies A 2 ) < 277 *. Third, 

lAil"*- ^^/U^(A^;Ai) > 0 implies /rs(Af;Ai) < 2rf/‘^. □ 
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The proof of the bipartite case of the Burr-Erdos conjecture follows by combining Lemma 15.41 
with Lemma [ 5.81 and then using the embedding scheme from Section HI Even though the theorem 
below is stated for graphs of large minimum degree, the density-embedding theorem result follows 
since every graph on n vertices of density at least a contains a subgraph of minimum degree at 
least ^an. 

Theorem 5.5. There exists a positive real number c such that for every natural number d and 
positive real number a, the following holds for all m satisfying m > If G is a graph on 

m vertices with minimum degree at least am, then it is universal for the family of d-degenerate 
bipartite graphs on at most d~^2~‘^^a'^‘^^m vertices. 

Proof. Define n = Let Lf be a d-degenerate bipartite graph on at most n vertices. 

By Lemma [ 3.11 there exists a vertex partition V{H) = U satisfying the following 

properties: 


(i) k < log 2 n, 

(ii) for all (i,j) G [k] x [2], we have < 2“*+^n, 

(iii) both Uie[A:] Uie[fc] independent sets, and 

(iv) for all (i, j) G [A;] x [2], each vertex v G has at most 4d neighbors in Ui'>i U wP. 

Define t = s = 32d and rj = Define 9 = We can use the concentration 

of hyper geometric distribution (see, for example, [22]) to find a partition Vi U V2 of V{G) for 
which I El I = IV 2 I = Y bipartite subgraph induced on Ei U V 2 has minimum degree at 

least Since rj < and 9 = ^r} ■ • y, we can apply 

Lemma Em with = 4d and to obtain sets ^ C Ei and B Q V 2 satisfying 

the following properties: 


(i) 1^1, |B| > j 2 E( 36 d)Q,tm ^ a,nd 

(ii) < 2 r]^/‘^ and pis,e{B‘^'^; A) < 277*/^. 

Define pi = c2~'^A^^d) c is a positive constant defined so that Yli&[k]Pi ~ Then 


c = 



1 

- 2-V(80<i) 


_ 1 _ 2-l/(80d) ^ 2 


/ ln2 (ln2)2 \ ^ 1 

V 80 d 2 ( 80 d) 2 y - 160 d‘ 


Thus, for all f G [/c], we have Pi > Pk = c2~^A^^d.) > ^2“ i°g2 "■/(sod) _ ^^.^^-1/(80^) > _^_j.j-i/(80d)_ 
We can apply Lemma [531 to the pair of sets {A, B) with = 2 , . ^5^ = s, t |5^ = 4 d, £ [5^ = 
^ = ^770;^'^+*, (£^) [5(^ = 1677*/^, and Pi as above since s > 16 d, pi > and |^|,|B| > 

^a^m > 2^00(i67yV2j-80(i^Q,4d-i-7)-80d^i60d(4^)407j.40g8, g^ygj^ g Jargg enough (recall that 

m > a~'^'^^). This provides partitions A = Uie[fc]^* ^ ~ Ui£[k]^i satisfying the following 
conditions for 9 i = \pi 9 : 


(i) for all i G [k], |74j| < pi\A\ and |Bj| < Pi\B\, 
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(ii) for all ij,ii,---,Ud) S , fJ^s,9j (Uaelid] ^j) < max{16r/‘/2,^)} < 

16 ? 7 */^, and 

(iii) for all {j,ii, ■ ■ ■ ,Hd) S (OaeKd] < max{16r/*/^ 5)} < 

We now apply the embedding scheme described in Section 01 For each i G [k], we will map 
to Ai and to Bi. We proceed by choosing one pair (i,j) G [k] x [2] at a time, following the 
lexicographical ordering on [A:] x [2] in the reverse order. For each set its corresponding 

defect parameter used in the embedding scheme is 9i = \pi9, and therefore 


7 < max 
ie[fc] 


max{|^j|, \Bi\} 
9i 


< max 
ie[fe] 


Pim/2 


2m 

~T 


16 




T]a 


By the properties above, the maximum average s-th moment defect ps is at most Vorfl"^. For each 
(A, j) G [A:] X [2], we have 

^ 2-*+in _ 2-^+^n ^ I0240dn ^ d2^^n _ 1 

9i ~ (ryQ;^'^+*m)/8 ~ ~ 2*/2 

Therefore, 


22(4d)+2 EE 


ie[fc]ie[ 2 ] 






ie[fc] 


16 


2^2 ^a4d+t 


%d 


IQrj 


i/2 


ie[fe] 


2^2 


f^32d+4 t/2—Sd —8(i(4d+t) 

* ^ ij Ct 


240d+9 . ^8d^-288d2 

240d+9 . /^36(i2-7\8d^-288(i2 ^ ^-led+Q ^ ^ 


Hence, by Theorem 14.21 we can find a copy of H in G. 


□ 


5.2 General graphs 

For general graphs, we prove the lemma corresponding to Lemma 15.41 in two steps. In the first 
step we hnd sets Hi, ^ 2 , • • • ,Ar that have small average defect in one fixed direction. 

Lemma 5.6. Let d, s, t, r he non-negative integers satisfying t > s and g be a positive real number. 
In every edge two-coloring of the complete graph Kn with red and blue, in the red graph or the 
blue graph, there exist sets Hi C • • • C satisfying the following conditions: 

(i) \Aj\ > for all j G [r], and 

(ii) for all 9 < n, ps,e{A^] E 27/* for all j <r — 1. 
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Proof. Define ^ 2 (r-i) = and arbitrarily color it with one of the colors. For i G [2(r — 1)], 

suppose that we constructed a set Ai of size at least There exists a color, say Cj, 

of density at least ^ in the set Ai. Since 9 < rj 2 ~'^\Ai\, we may apply Lemma ET] to the subgraph 
induced on Ai with the edges of color c* to hnd a set C Ai of size |j4j_i| > 2~^~^\Ai\ such 
that Ai) < 2 rf in the graph consisting of the edges of color c*. Color the set Ai-i with 

the color that we used. 

Repeat the process to find sets ^ 2 r -2 2 ^ 2 r -3 D • • • D Aq. Note that |^o| > 

By the pigeonhole principle, we can find r indices ii < ■ ■ ■ < ir for which Ai. are all colored by 
the same color, say red. These sets satisfy Property (i). Since Hs,e{Q;X) < fis,e{Q',Y) holds for 
all sets X D Y, we have ^s,e{Af^_.^; Ai^) < ^s,e{Af^_.^; Ai^_.^^i) < 2rf in the red graph for all 
a G [2,r]. Thus, the sets ^* 2 )''' satisfy the claimed properties. □ 

Given the sets ^i, • ■ ■ ,Ar constructed in the previous lemma, we run r more rounds of de¬ 
pendent random choice to produce an r-tuple of sets that have small average defect towards each 
other. At the i-th round, we will choose a random tj-tuple Xj G A** and update each set Aj for 
j ^ i to X(X) n Aj. This will ensure that the average defect of A’f- into Ai is small. The next 
lemma shows that all the conditions are maintained throughout this process. 

Lemma 5.7. Let d,r,s, and t be non-negative integers satisfying t > s. Let ^ and 9 be positive 
real numbers satisfying ^ and 9 < In every edge two-coloring of the complete 

graph Kn with red and blue, in the red graph or the blue graph, there exist sets Aj for j G [r] 
satisfying the following properties: 

(i) \Aj\ > 9 for all j G [r], and 

(ii) UspiXtj-, Aj) < If for all j G [r]. 

Proof. For i = 0,1, ■ • • , r, define ti = -|- t) and di = d-\- ij- Note that gtj > di > 

tj+i for all z G [r — 1]. Define = 2“^°*°^ and note that ^ < ^o- Define 9o = ^n, 9i = ^9o = 
and note that it suffices to prove the statement for 9 = 9i. 

We may apply Lemma ES with = do, = 0, ^tM] = *o, ^Eg] = WM = 

since 9i < This gives sets Bi C ■ ■ ■ C. B,. in, without loss of generality, the 

red graph, satisfying \Bj\ > for all j G [r], and ^ 2(2“^®*°'')*° < for 

all j G [r — 1]. Let Aqj = Bj for all j G [r]. For each z = 0,1, • • • , r, we will iteratively construct 
sets satisfying the following properties: 

(a) \Aij\ > 9 q for all j G [z,r], 

(b) |Ajj| > 9i for all j < i, 

(c) no^ 0 f,{A'^y, Aij+i) < for all j G [z,r - 1], 

(d) for all j < i, 

(e) Aij C Aij+i for all j > i. 
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Note that the properties hold for i = 0 (the only relevant properties are (a),(c) and (e)). In the 
end, the sets Aj = Arj for j € [r] satisfy Properties (i) and (ii) by Properties (b) and (d) since 

dr = d, 9i = and tr = 8{d + t)> 2t. 

Suppose that for some z > 1, we have constructed sets for which the properties 

hold. For simplicity, we abuse notation and write Bj = Ai-ij for all j G [r]. Let X G be a 
tj-tuple chosen uniformly at random. For each j ^ i, define Aj = Bj nN(X.) and define A* = Bi. 
There are several events that we consider. 

Event 1. |Aj| > 0q for all j > i and |Aj| > 6i for all j < i. 

Note that the claim trivially holds for Aj since |Aj| = \Bi\ > 9 q. For j > i, since Bi C Bj-i, 

P (|A,| < 9o) = f,o,g,iBf_,-, Bj) 

— 9-o,9o{Bj"_i; Bj) = ^ Bj). 

Since dj_i > U, by Proposition [33] and Property (c), we have i^o^g^{Bf_^-, Bj) < fio^g^{Bf_.^; Bj) < 
Hence, the probability that |Aj| < 9o is at most ^-L+L-i/2 < Similarly, for j < i, 

since Bi C B-j, 

P (|A,| < 9i) = fioMBl-.B,) < iJ.oMB%-,Bj) 

By Property (d), this is at most ^-2ii+L-i/2 < Since there are r total events, the proba¬ 

bility of Event 1 is at least 1 — 

Event 2. Aj+i) < for all j G [i,r — 1]. 

For j G [z, r — 1], since Bi C Bj, we have 

E[jio,eo{Bf;Aj+i)]=-^ Y, fio,eoiBf;N{Q)nBj+i) 

\-^i\ ^ f 
QEB/ 

fio,e,{Bf-,NiQ) n Bj+,) < Aj+,)], 

where ly is the probability distribution obtained by choosing X uniformly in the set H** instead 

of By Proposition [53] and Property (c), E,,[/io,eo(-Sj'*L A^+i)] < ^ 

Since < ^ii_i/4^ j^y Markov’s inequality, the probability of Event 2 for a fixed index 

j is less than 1 — Since there are at most r choices for the index j, the probability of Event 2 
is greater than |. 


B,\Y 1 
mj 
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Event 3. ; Aj) < 4r^*i-i/4 for all j < i . 

As in Event 2, we have -^j)] < ] Bj) < (by Property (d)). 

Therefore, by Markov’s inequality, the probability of Event 3 for a fixed j is less than 1 — 
Since there are at most r choices for j, the probability of Event 3 is greater than 


Event 4. (Q; 


Note that E 


EggA'^t ^01 {Q'^ BiY 


< ^eYQ'^ BiY • P (Q e ■ Since 61 = ^Oq < 


^\B,\, if 0JeAQ-,Bi) Y 0, then \N{Q-Bi)\ = 

( \N{Q-,Bi)\ 


— I 

^ 


< 






< 




^ Therefore, E 


^ ^ = 




Hence, with probability greater than we have En Hj)® < 


Therefore, with positive probability all four events hold. Let A be a particular choice of X for 
which all four events hold, and define Aij = Aj for this choice of X. Event 1 immediately implies 
Properties (a) and (b). Since \Aij\ > 9i > C‘^\Bj\ holds for all j G [r], it follows by Event 2 that 
for all j G [i, r — 1], 


Ai+i) < Ai+i) < • 4re 


':—2di 


di . 


c—‘^di 


^ ^til2 


implying Property (c). Similarly, Event 3 implies fig ^gj^{AfYj', Aj) < ,fL/2 foj. j < i. Finally, 
Event 4 implies 



^ W0i(Q; < 

Q^4:-i 


t-2di 


\B-iY 


- ueYQiBiY 


Q&Bt\ 


proving Property (d). Property (e) follows from the same property the previous step, and the 
definition Ajj = N{X) n Ai_ij for j / f. □ 

We conclude this section with the proof of our main theorem, Theorem 11.11 

Theorem 11.11 There exists a constant c such that the following holds for every natural number 
d, r, and n satisfying n > . For every edge two-coloring of the complete graph on at least 

2 ^ 2 '=’’vertices, one of the colors is universal for the family of d-degenerate r-colorable graphs on 
at most n vertices. 

Proof. Define m = for a large enough constant c. Suppose that we are given an edge 

coloring of Km with two colors red and blue. Dehne t = s = 32d and ^ Define 

9 = fYm. Lemma 15.71 with = 4d implies that in the red graph or the blue graph, there exist 

sets {Aj}jg[^] satisfying the following properties: 

(i) l^jl > 9 for all j G [r], and 

(ii) ^XsfiiA^Y^j^Aj) < ff. 
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Define G as the subgraph of consisting of the edges of the color realizing the two properties 
above. 

Let H he a d-degenerate r-colorable graph on at most n vertices. By Lemma l3.11 there exists 
a vertex partition V{H) = U(i j)e[fc]x[r] satisfying the following properties: 

(i) k < log 2 n, 

(ii) for all (i,j) G [/c] x [r], we have 

(hi) for all j € [r], the set Uje[fc] is an independent set, and 

(iv) for all (i,j) G [/c] x [r], each vertex v G has at most 4d neighbors in Ui'>i j'e[r] 

Define pi = where c' is a positive constant dehned so that Yli£[k]Pi ~ Then 


d = 


> 




_ 1 _ 2-l/(80d) ^ ^ 


(. In 2 (In 2)^ \ 
V 80d 2(80d)2 ) 


1 

“ 160d' 


Thus, for all i G [/c], we have Pi> Pk = cf 2 ~^d^ 0 d) > ^> 2 - iog 2 n/ (sod) _ ^/^-i/( 80 d) > _^^-i/( 80 (i) ^ 
Since pi > s > 16d, and m > n > if c is large enough, then we can apply 

Lemma [5.31 to G with c| 5 ] 3 ] = 4d, (g^) [ 5 ^ = aiid sets {Aj}jg[^], to obtain sets 

{^i^^}(i,j)e[k]x[r] satisfying the following conditions for 6 i = ^PiO: 


(i) for all (i, j) G [A:] x [r], < Pi\Aj\, and 

(ii) for all • • • , {kdjid)) G {[k] x [r])‘^"*+^ satisfying ji, ■ ■ ■ , j 4 d 7^ j, 


Psfii 





< max 




t/2 


a.r.4<^C—4d,, /'/14c 

8r 4 Ps,9[A_ 


id . 

i’ 




)} 


We now apply the embedding scheme described in Section 01 We choose one pair (z, j) G [A:] x [r] 
at a time, following the lexicographical ordering of [A:] x [r] in the reverse order, and map 
to . For each set Wf , its corresponding defect parameter used in the embedding scheme is 
6 i = ^PiO, and therefore. 


2 r 

e' 


Moreover, by Property (ii) above, the maximum average s-th moment defect satisfies ps < 
For all (z,j) G [A;] x [r], since m > > 640rd^“^n for large enough c, 

2“*+^n ^ 640rdre ^ 1 ^ 

Oi ~ Tc'2-V(8Od)0 - 2^2 . ^2^ - ^ 

Therefore, for large enough c, 


28d+2^8d^l^^ ^ ^ 

ie[k] je[r] 


Oi 
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Hence, by Theorem 14.21 we can find a monochromatic copy of H. 


□ 


6 Random pruning 


In this section, we prove Lemma 15.31 the final ingredient of the proof. One of our main tools is 
the following concentration inequality (see [291 Theorem 3.1]). 


Theorem 6.1. Let X = (Xi,X 2 , 


, Xn) be a family of independent random variables with Xi 
satisfies |/(x) — f{y)\ < Ci whenever the vectors x and y differ only in the i-th coordinate. Then 


taking values in a set for each i. Suppose that the real-valued function f defined on Ojeln] 


/(X) -E[/(X)] > 


The next lemma shows that given the collection of sets obtained in Lemmas 15.41 and 15.61 we 
can further impose that the defect is not concentrated too much on individual vertices. This 
additional condition will help us later when taking a random partition. 


Lemma 6.2. Let e be a fixed positive real number, and d,s,r be fixed natural numbers satisfying 
s > Ad. Let m be a natural number satisfying m > and 6 be a natural number 

satisfying 6 > em. Let Ai,A 2 , ■ ■ ■ ,Ar be (not necessarily disjoint) vertex subsets satisfying em < 
\Ai\ < m and fis,e{A'Li', Ai) < 1 for all i € [r]. Then there exist subsets Bi C Ai for all i E [r] 
satisfying the following properties: 

(i) \Bi\ > 2~^f^‘^^'l\Ai\ for alii E [r], 

(a) Bi) < 2ps,e{A'Li; Ai) for all i E [r], and 

(Hi) for every i E [r] and v E B_i, ^ 

Proof. We will fix 9 throughout the proof and thus for simplicity will use the notation toe = to 
and Hsfi = Hs- Let Ri C be the set of vertices v E such that > 

|^_.|d-5/8^ Then 

\Ri\ • \A_iY-^/^ < ^ ^ uj{Q -< d • |H_i| < d\A_i\Y 

•v&Ri Q:veQ&AP 

Therefore, |i?j| < d\A-i\^^^ < drwf^^. For each i E [r], let Bi be the set obtained from Ai by 
removing the vertices in Ujg[r] Then \Bi I > \Ai\ — > 2 and (i) holds. 

Fix i E [r]. Since s > Ad, by Proposition 13.61 all d-tuples in Af^ have at least . ^.^,4 > 

{rrrYfl^ — common neighbors in Ai. Since Bi is obtained from Ai by removing at most 

dr^^ui^/s vertices, all d-tuples Q E Bf^ satisfy 

/ dr'^m^!'^ \ ( dr'^m^!'^ \ 

'*•>1j " IPw) 

/ j^9/4 \ 

> \N{Q;Ai)\>2-^/(^^^\N{Q;Ai)\. 
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Hence, uj{Q\Bi) < Ai). Therefore, 




BM 


Y. u:{Q;B,y<j^^ Y 2y^u{Q-A,y 


Q&sy 


Q&Ay 


= 2^/2 


\BM 


1^-* 


co{Q-,Aiy\ <2^,yA-^■,A^)■ 


Q&Ad 


This proves (ii). Moreover, since Bi = Ai\Ri, each vertex v ^ Bi satishes Ylg-vegeA'^ -^i)^ < 

Therefore, 


X] oj{Q-,B,y < Y 2i/2u;(Q;H*)* < 2V2|^_.|<^-5/8 < 2|B_i|'^-5/8, 

Q:v&Q&Bti Q.v&QeBti 


proving (iii). □ 

Lemma [6.21 prepares an r-tuple of sets with small average defect towards each other by further 
imposing that the defect is not concentrated too much on individual vertices. We now prove 
Lemma 15.31 bv taking a random partition and showing that the defect is well-distributed. 

Lemma 15.31 Let k, d, s, r be fixed natural numbers satisfying r > 2, s > 4d, and e, e' be fixed 
positive real numbers. Let m be a natural number satisfying m > 2200^£'^-80g-80(ij,i60(i^40^40g8^ 
and 0 be a natural number satisfying 9 > em. Let pi for i G [k] be positive real numbers satisfying 
Ylie[k]Pi — 1 Pi — for all i G [k]. Suppose that are vertex subsets with 

em < \Aj\ < m, satisfying pspiA'^j; Aj ) < ^ for all j G [r]. Dehne 6 i = ^PiO for all i G[k]. Then 

there exist sets {^/'^^}(i,j)e[fc]x[r] satisfying the following conditions: 

(i) For all i G [k] and j G [r], we have ^ Aj and <Pi\Aj\. 

(ii) For all ((f, j), • • • , {id,jd)) G {[k] x [r])''+^ satisfying ji, • • ■ Jd^ j, 

( n <max{£',8/e-V,0(Hl^.;H,)}. 

\ae[d] / 

Moreover, if r = 2, then the factor can be replaced by 1. 

(iii) For all (i,j) y the sets and ^ are disjoint. 

Proof. Note that the given condition on {pi}i^[k] implies k < By Lemma 16.21 we can 

find subsets Bj C Aj for j G [r] satisfying the following conditions: 

(a) \Bj\ > 2“^/(2rf)|^^.| fQp all j G [r], 

(b) ps,e{B^j-,Bj) < 2ps,e{Atj\ Aj) < 1 for all j G [r], and 

(c) for every j G [r] and v G B^j, we have Ylq-.v&Q&B'i ^e{Q]Bjy < 2|i?_j|'^“®/^. 
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Define qi = ^pi for i € [k]. Color the vertices with [k] x [r] randomly where each vertex receives 
color {i,j) with probability qi and the outcome for each vertex is independent. For each (i,j) G 
[k] X [r], let C Bj be the set of vertices of color Let Ei be the event that for all 

{i,j) € [k] X [r] and all d-tuples Q G we have \N{Q-,V^^'^)\ > ^qi\N{Q-, Bj)\. Let E 2 be the 
event that for all (ii, Ji), • • • , {id,jd)) G ([/c] x [r])^+^ satisfying ji, ■■■ j, 



where if r = 2, then we replace the factor r'^e by 1. Let E 3 be the event that for all (i,j) G 

[k] X [r], we have < 2qi\Bj\. 

We first show that Properties (i), (ii), (hi) follows conditioned on the events Ei,E2 and 

E3. Property (iii) immediately follows from definition of the sets. Property (i) follows from E^ 

and \Bj\ < \Aj\ for j G [r]. For some j G [r], fix a d-tuple Q G B^y Since \N{Q]Vl^'^)\ > 

}2qi\N{Q] Bj)\ holds by Ei, if \N{Q]Bj)\ > 6, then we have > \qi 9 . Therefore, if 

u} 0 {Q-,Bj) = 0, then ujq.{Q;V}^^) = 0. Otherwise, if u} 0 {Q;Bj) / 0, then u} 0 {Q;Bj) = > 

- = uj 0 ^{Q-,V^^^). Therefore, u} 0 .{Q;Vp'^) < (jj 0 {Q;Bj) in both cases. This implies that 

2|A^(Q;V^ )| 

for all ■ ■ , {u,jd)) £ {[k] x [r])''+\ we have by E 2 , 







Therefore, by (b), we have Property (ii) (similarly Property (ii) holds for r = 2 as well). 

It thus suffices to show that the intersection of Ei,E 2 , E^ has non-zero probability. To compute 
the probability of Ei, note that for {i,j) G [k] x [r], since s > 4d and Bj) < 1, by 

Proposition 13.61 all d-tuples Q G B^^j- have at least common 

neighbors in Bj. Hence, for a fixed d-tuple Q G B'iy 


E 


mQ-,vl''’)\ 


qi\N{Q;Bj)\>qiSr 


Therefore, by Theorem 16.11 the probability that \N{Q-, < \qi\N{Q-., Bi)\ for a fixed Q G B*^- 

and (i,j) G [A:] x [r] is at most Since there are at most 

choices for Q G B'^p there are at most rk ■ {rm)^ such events. By the union bound, 

P(L^D < 

Note that by Theorem 16.11 


f^2-^/(2d)q.iBjl < < 2qi\Bj\^ > l-2e-2(i-2 


_r) —el{&4:d?E)m}/'^ 


> l-2e 


> 1 - 


3{kr)^+^ ■ 
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There are (fer)'^+^ choices for • • • , iid,jd)) G {[k] x and thus PiE^) > 

To compute the probability of E 2 , fix • • • , {id,jd)) S {[k] x To simplify 

notation, define Q = nae[d] and B = Oaeld] Note that 


E 


Qea 


^u;g(Q;B,r-P(Q€Q). 

Q&B 


( 4 ) 


Fix Q G B. If all vertices in Q are distinct, then P((5 € Q) = Ilaelrfl^^a- Otherwise, if dB 
is the set of d-tuples in B where not all vertices are distinct, then by Proposition 13.51 we have 


'^Q^dB^e{Q',BjY < 2em^ Y^Q&B^e{Q] BjY < 2em YjQ&Bi-^diQ] Bj) 
Property (b) and \B-j\ < rm. Thus, in (j^, 


d[d-l) 


< e 


-1 


cPr'^m'^ 1 , ]^y 


E 


^ ^ ^diQ'-i Bj 
Qea 


<e ' Y^^s(Q',BjY 

\ae[d] / QeB 

= + ( n I \B\f^s,8iB;Bj). 

\ae[d] / 


For each vertex v G B_j, by (c), we know that YIq v&q&b ^diQ'i BjY ^ 2\B_jY~^/^. Therefore, 
the random variable Y^QeQ^diQj BjY can change by at most 2 \B-jY~^^^ < 2r'^mY~^l^ if we 
change the outcome of a single vertex. Since Oaelrf] ^ by Theorem 16.11 the 

probability that the random variable is greater than A := [Uaeld] Qia) \B\ max ^2fis,e{B-, Bj), 
is at most = 2 e- 0 ')V( 2 r- 2 ^m 2 'i-i/ 4 )^ 


A' := A - E 


Since 


J2MQ;Bj 

Qea 


> A - max I 3e ^r'^d^rrY \ | ( H ) \B\d^s,e{B] Bj) i > ^A. 


i ae[rf] 


A = I n 9*- J l^l™ax|2/x^,0(^;5j),|-| > (^r V(iorf)^ (2 = 


^ ^ ^ d- 1/10 


J^ae[rf] 

the above mentioned probability is at most 




m 




< 


4 (/j^)d+l ' 


Since there are at most (/cr)'^+^ choices of indices, we see that YYq^q^YQ'-, BjY < A holds for 
all choices, with probability at least |. Recall that E 3 holds with probability greater than |. We 
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show that-B 2 holds if both these events hold. For afixed (ii,ji), • • • ,{id,jd)) £ 

following the notation above, event implies that ^Oaeld] ^ Therefore, 

^e{Q] BjY < A = I 1 1^1 max 1 Bj), 

QeQ \ae[d] / 

<2^/2|Q|max|2/is_e(e;Sj),|-| < \Q\uiasL^2^/‘^ Bj),e'^ . 

Note that if r = 2, then B = B^j and thus ^s,e{B]Bj) = Bj). Therefore, we have 

E 3 . If r 7 ^ 2, then since all sets Ba for a € [r] have size between and m, we see that 

\B-j\ < 2^/(^'^Ve“^|i?a| for all a Y j- Therefore, \B‘!ij\ < 2^/‘^{r£~Y'^\B\, and 

= ^ E ^eiQ', BjY = Bj). 

Q&B Q&B<i. 

Thus, E 2 holds with probability greater than A. We proved that there is non-zero probability 
that all Ei,E 2 ,E^ hold, and as seen above, this proves the lemma. □ 

7 Concluding remarks 

Original form of the Burr-Erdos conjecture. The definition of Ramsey numbers can be 
extended to pairs of graphs. For a pair of graphs Hi and H 2 , the Ramsey number of the pair 
{Hi, H 2 ), denoted r{Hi, H 2 ) is the minimum integer n such that in every edge coloring of with 
two colors red and blue, there exists a red copy of Hi or a blue copy of H 2 . The arboricity of a 
graph is the minimum number of forests into which its edge set can be partitioned. The original 
conjecture of Burr and Erdos [3j can be stated as follows: 

Conjecture 7.1. For every natural number d, there exists a eonstant c such that for every pair 
of graphs Hi and H 2 , eaeh having arboricity at most d, r{Hi,H 2 ) < c(|E(Ffi)| -|- \V{H 2 )\). 

It is well-known that arboricity and degeneracy are within a factor two of each other and 
hence Theorem 11.11 implies this conjecture. Moreover, our proof can be extended to more than 
two colors. 

Determining the constant. For graphs with fixed chromatic number, the constant Cd we 
found is exponential in d, which is best possible up to the constant in the exponent. For general 
degenerate graphs, the constant Cd we obtained is double-exponential in d, and it still remains 
to understand the correct behavior of this constant. The corresponding question for bounded 
degree graphs is reasonably well-understood since Conlon, Fox, and Sudakov PEI IS] proved 
r{G) < for all graphs of degree at most A, and r{G) < c''^|R(G)| for all r-colorable 

graphs of degree at most A. These bounds are close to being best possible since Graham, Rodl, and 


31 



Rucinski proved that there are bipartite graphs of maximum degree A having r{G) > c^|R(G)| 
(for some different constant c). Moreover, the author |28] proved that a transference principle 
holds for bounded degree graphs, and thus if G has a ‘simple’ structure, then the bound on the 
constant can be significantly improved. For example, if there exists a homomorphism / from G 
to a graph H having maximum degree at most d where = o(n) for each v € V{H), then 

r{G) < Hence, in this case the constant does not grow together with the maximum 

degree of G. 

Further applications of the technique. We used a random greedy embedding algorithm 
together with dependent random choice. Theorem 14.21 shows that our embedding algorithm suc¬ 
ceeds with probability greater than i. Hence, a careful analysis will show that there are in fact 
many copies of the graph of interest. It would be interesting to find further applications of these 
methods. For instance, by using a variant of the proof of Lemma 15.71 as in m together with the 
embedding methods developed in this paper, one can show that a weak version of the blow-up 
lemma holds for degenerate graphs. Namely, for all d, 5 and sufficiently large n, there exist e 
and c such that if are disjoint vertex subsets each having size at least n and {Vi,Vj) 

are (e, 5)-dense for each distinct i,j, then it contains as subgraphs all d-degenerate r-colorable 
graphs with at most c^n vertices. One might be able to further develop this approach as in 
and extend the bandwidth theorem of Bottcher, Schacht, and Taraz [3] to degenerate graphs of 
sublinear bandwidth. 

Related problems. As observed by Burr and Erdos, graphs with at least (1 -|- e)nlogn edges 
have Ramsey numbers superlinear in the number of vertices. On the other hand, they showed 
that there are graphs with ere log re edges for some constant c that have Ramsey numbers linear in 
the number of vertices. It would be interesting to further classify the graphs that have Ramsey 
number linear in terms of its number of vertices. An interesting test case is the class of hypercubes 
Qn, for which we slightly improved the previous best known bound to r{Qn) = (1 -|- o„,(l))2^"'. 
Burr and Erdos conjectured that there exists a constant c such that r{Qn) < c2"' holds for all 
natural numbers re. 

In a similar direction, there has been much work aimed at understanding the Ramsey number 
of a graph in terms of its number of edges. The most notable result in this direction was proved 
by Sudakov [33], who confirmed a conjecture of Erdos and Graham by showing that r{H) < 
2 Cy/m j^olds for all graphs H with m edges. Also, Conlon, Eox, and Sudakov have an interesting 
conjecture [TO] Conjecture 2.16], asking whether log(r(L7)) = 0(d(R)-|-log re) holds for all re-vertex 
graphs H, where d{H) is the degeneracy of H. 

Acknowledgements. I thank David Conlon, Jacob Eox, and Benny Sudakov for their valuable 
remarks. I also thank the anonymous referee for carefully reading the manuscript and making 
several useful comments. 
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